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Abstract 

Background: Necrotizing enterocolitis (NEC) is a devastating intestinal disease that afflicts 10% of extremely 
preternn infants. The contribution of early intestinal colonization to NEC onset is not understood, and predictive 
biomarkers to guide prevention are lacking. We analyzed banked stool and urine samples collected prior to disease 
onset from infants <29 weeks gestational age, including 1 1 infants who developed NEC and 21 matched controls 
who survived free of NEC. Stool bacterial communities were profiled by 16S rRNA gene sequencing. Urinary 
metabolomic profiles were assessed by NMR. 

Results: During postnatal days 4 to 9, samples from infants who later developed NEC tended towards lower alpha 
diversity (Chaol index, P = 0.086) and lacked Propionibacterium {P = 0.009) compared to controls. Furthermore, NEC 
was preceded by distinct forms of dysbiosis. During days 4 to 9, samples from four NEC cases were dominated by 
members of the Firmicutes (median relative abundance >99% versus <17% in the remaining NEC and controls, 
P < 0.001). During postnatal days 10 to 16, samples from the remaining NEC cases were dominated by Proteobacteria, 
specifically Enterobacteriaceae (median relative abundance >99% versus 38% in the other NEC cases and 84% in 
controls, P = 0.01). NEC preceded by Firmicutes dysbiosis occurred earlier (onset, days 7 to 21) than NEC preceded by 
Proteobacteria dysbiosis (onset, days 19 to 39). All NEC cases lacked Propionibacterium and were preceded by either 
Firmicutes (>98% relative abundance, days 4 to 9) or Proteobacteria (>90% relative abundance, days 10 to 16) 
dysbiosis, while only 25% of controls had this phenotype (predictive value 88%, P = 0.001). Analysis of days 4 to 9 urine 
samples found no metabolites associated with all NEC cases, but alanine was positively associated with NEC cases that 
were preceded by Firmicutes dysbiosis (P < 0.001) and histidine was inversely associated with NEC cases preceded by 
Proteobacteria dysbiosis (P = 0.013). A high urinary alanine:histidine ratio was associated with microbial characteristics 
(P < 0.001) and provided good prediction of overall NEC (predictive value 78%, P = 0.007). 

Conclusions: Early dysbiosis is strongly involved in the pathobiology of NEC. These striking findings require validation 
in larger studies but indicate that early microbial and metabolomic signatures may provide highly predictive 
biomarkers of NEC. 
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Background 

Preterm birth contributes disproportionately to the global 
burden of morbidity and mortality in infancy. Necrotizing 
enterocolitis (NEC) is a devastating emergency of preterm 
infants, which affects about 10% of infants born <29 weeks 
gestational age, with a case-fatality of about 30% [1,2]. 
NEC typically occurs without clinical warning between 3 
days and several months of postnatal life [3]. Risk factors 
for NEC include immaturity [4], timing and type of infant 
feeding [5,6], extended empirical use of antibiotics [7,8], 
and intestinal bacterial colonization [9-12]. Unfortunately, 
after decades of research, the specific pathogenesis of 
NEC remains an enigma, and validated biomarkers to 
identify individuals at highest risk of disease are lacking. 

Intestinal colonization has remained an important target 
of investigation in the etiology of NEC. The microbial 
community of preterm infants, compared to healthy, term 
infants, consists of dramatically fewer beneficial species, 
lower bacterial diversity, and more pathogens [9-12]. NEC 
does not occur in germ-free animals, lending credence to 
the importance of intestinal colonization to the develop- 
ment of NEC [13]. Several epidemiologic studies in pre- 
term infants report an association between early empirical 
antibiotic use and subsequently increased risk of NEC, 
while randomized, controlled trials in preterm infants sug- 
gest that probiotic agents may reduce the risk of NEC 
[7,8,14]. The immature enterocytes of preterm infants ex- 
hibit excessive signaling in the TLR4 pathway in response 
to the lipopolysaccharide (LPS) presented by Gram- 
negative bacteria, and this interaction has often been im- 
plicated in NEC onset [12,13,15]. Various other agents 
have also been associated with NEC onset, including bac- 
terial, viral, and fungal pathogens [9-11,16-18]. Despite 
this, no specific microbial pattern has been consistently 
identified with NEC onset. 

Early colonizing organisms interact with the intestinal 
mucosa to shape the developing immune system towards 
homeostasis or dysregulation [19-24], and could thus 
contribute to the pathobiology leading to onset of NEC. 
However, early colonization has been largely overlooked as 
a focus of study. To address this gap, we examined the 
early microbial community to identify predictive microbial 
biomarkers of later NEC in a prospective study of preterm 
infants. Culture-independent 16S rRNA gene sequencing 
of stool samples from 4 to 16 days of life was utilized to 
identify microbial community signatures. Because intes- 
tinal bacteria can influence the metabolic profiles of their 
hosts [25-29], we pursued urinary metabolomics to identify 
surrogate biomarkers of NEC. 

Methods 

Preterm infant cohort 

All infants in this study were part of an ongoing larger 
study of the preterm microbiome and determinants of 



NEC, which was conducted in two level III neonatal inten- 
sive care units (NICUs) in Cincinnati, Ohio. Infants 
were enrolled between October 2009 and August 
2010. Enrollment criteria included being free of congenital 
anomalies and survival free of NEC in the first week of 
postnatal life. After Institutional Review Board approval 
and parental informed consent, standardized clinical data 
were collected following the National Institute of Child 
Health and Human Development (NICHD) Neonatal 
Research Network protocol until discharge from hospital. 
All cases were reviewed by a senior neonatologist (KRS). 

A total of 35 preterm infants <29 weeks gestational 
age and < 1,200 g birth weight were included in this study. 
The primary analysis included 11 infants who developed 
NEC and 21 control infants. As a secondary comparator, 
we included three non-NEC deaths attributed to respiratory 
distress syndrome or suspected infection. Non-NEC deaths 
were included as they represent a competing outcome with 
NEC during hospitalization in the NICU, and clinical stud- 
ies of probiotics and antibiotics report similar associations 
with NEC and death [7,8,14], suggesting the possibility that 
they may be similarly related to the intestinal microbiome. 
Controls were consecutively enrolled infants from the same 
NICUs who survived free of NEC or sepsis. NEC was 
defined as modified Bells stage II or III [30]. 

The infant microbiome was analyzed by postnatal time 
periods, days 4 to 9 and 10 to 16. These periods were 
selected because the extremely premature infants in this 
study did not stool consistently and did not stool prior to 
day 4 of life. Each period included two planned sample col- 
lections, increasing the chances of having a sample available 
from each infant during the interval. Stools were collected 
from soiled diapers; urine samples were collected from the 
infant diaper using cotton balls. Upon collection, samples 
were immediately refrigerated in the NICU and transported 
to the study laboratory daily. Stool was scraped into a tube 
containing thioglycollate, and frozen at -80°C until the 
DNA could be extracted. Upon removal from the cotton 
ball, urine samples were frozen at -80°C without an addi- 
tive until metabolomic analysis was performed. All sample 
collection and storage utilized preprinted, bar-coded labels. 

In the few instances when two stool samples were 
available per infant in a collection period, statistical 
independence was ensured by including only the earlier 
sample. Further, only samples collected prior to diagnosis 
of NEC were included. These criteria resulted in inclusion 
of 58 stool samples for analysis, 18 from the 11 NEC 
cases, 37 from the 21 control infants, and 3 from the 3 
non-NEC deaths. Urine samples were collected during 
days 4 to 9, and all available samples collected prior to 
NEC onset in that time period were analyzed by NMR 
[31]. A total of 60 urine samples, 18 from 11 NEC cases, 
36 from 20 controls, and 6 from non-NEC deaths, were 
included in initial analyses. Final analysis of urinary 
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biomarkers was restricted to the first urine sample collected 
in the day 4 to 9 period. 

DNA extraction 

Bacterial DNA was extracted from infant stool samples 
using one of two methods: phenol-chloroform or the 
QiaAmp DNA stool kit (Qiagen Sciences, Germantown, 
MD). For the phenol-chloroform method, samples were 
thawed, centrifuged and supernatant removed. From the 
sample, 0.2 g of stool was transferred to a 2 mL screw-cap 
tube containing 0.3 g of 1 mm zirconia beads. Next, 1 mL 
Tris-EDTA (TE) buffer was added and the sample 
resuspended. Then, 150 [iL of buffer saturated phenol was 
added and the sample bead beat for 3 minutes at 4°C. The 
sample was then allowed to sit for 1 minute at 4°C before 
extraction with 150 \iL chloroformiisoamyl alcohol (24:1). 
Samples were centrifuged for 5 minutes at 15,700^ at 4°C 
and the resulting aqueous layer was transferred to a clean 
tube and again extracted with an additional 150 [iL phenol 
and 150 [iL chloroformiisoamyl alcohol. Samples were then 
centrifuged for 2 minutes at 15,700^ at 4°C. The aqueous 
layer was subjected to two additional chloroformdsoamyl 
alcohol extractions. The resulting aqueous layer was trans- 
ferred to a clean microfuge tube a final time and 4 [iL of 
5 mg/mL glycogen was added and mixed. This was 
followed by the addition of 1/10 volume of 3 molar sodium 
acetate and 2 volumes of 70% cold (-20°C) ethanol. The 
samples were mixed and then centrifuged at 15,700^ for 5 
minutes at 4°C. The supernatant was discarded and 
the pellet resuspended in 100 [iL TE buffer. For the 
extractions using the QiaAmp DNA stool kit, samples 
were thawed and 0.2 g of stool was transferred to a 
bead beating tube containing 0.3 g of 0.1 mm glass 
beads, 1.4 mL of buffer ASL was added, and bead 
beating was conducted for 3 minutes on the homogenize 
setting. The suspension was then heated at 70°C for 5 mi- 
nutes, and the manufacturer s instructions followed. 

16S rRNA gene analysis 

The V3 to V5 window of the 16S rRNA gene was amplified 
and sequenced using 454 FLX Titanium sequencing as 
already described in full detail [32]. A total of 1.3 million 
resulting sequences were processed using a data curation 
pipeline implemented in mothur [33], complemented by 
UCHIME for chimera detection [34]. This processing has 
been detailed previously [35], with the modification that 
AbundantOTU was replaced by Newbler for assembly- 
based error reduction [36] and followed by mothur's 
implementation for operational taxonomic unit (OTU) 
clustering (parameters: method = average, cutoff = 0.03, 
precision = 1,000). The mean read count per sample was 
4,989. Representative sequences per OTU were classified 
with the MSU RDP classifier v2.2 [37] using the 
Greengenes taxonomy [38] . The 16S rRNA gene sequences 



obtained from the study samples were assigned to a total of 
411 distinct OTUs. OTUs were then assigned phylogenetic 
classifications, typically to the genus level. 

Metabolomic analysis 

Urine samples were thawed on ice immediately prior to 
preparation for NMR analysis. A 1 mL aliquot of each sam- 
ple was centrifuged for 10 minutes at 2,655^, 350 [iL of 
clarified urine pipetted into a 1.5 mL microcentrifuge tube, 
and 350 [iL of buffer added to each sample. Next 600 [iL of 
the urine/buffer mixture was pipetted into a 5-mm NMR 
tube. All NMR experiments were carried out on a Bruker 
Avance™ III spectrometer operating at 600 MHz 
frequency equipped with a room-temperature 5-mm 
triple-resonance probe with inverse detection and 
controlled by TopSpin 3.0 (Bruker Biospin, Germany). 
Experiments were conducted at 298 K. Data were collected 
using a spectral width of 20.0 ppm. Three NMR experi- 
ments, optimized by Bruker (Bruker BioSpin, Germany) for 
use in metabonomic studies, were run on all samples: a 
standard one-dimensional (ID) presaturation (zgpr), a ID 
first increment of a NOESY (noesygpprld), and a CPMG 
(cpmgprld); however, only the CPMG, which produced 
superior baselines for analysis, was used for metabolomic 
analysis. The transmitter offset frequency (Ol) was adjusted 
to obtain optimal water suppression. The 90° pulse widths, 
determined for every sample using the automatic pulse 
calculation feature in TopSpin, were between 10 and 12 [is. 
Water suppression in all experiments was achieved by 
irradiation of the water peak during the recycle delay. Next 
ID zgpr NMR spectra were collected to assess the shim 
quality, which was considered acceptable when the line 
width was <1 Hz and the line shape enabled detection of 
resolved C13 satellites for the TSP internal standard. The 
CPMG experiment used 64 transients with 4 dummy scans, 
46,280 points per spectrum giving an acquisition time of 
1.87 s, a T2 filter loop of 128 with an echo time of 1 ms, 
apodized using -0.01 Hz of exponential line broaden- 
ing, and a 4-s recycle delay. All NMR spectra were 
phased, baseline corrected, and subjected to chemical 
shift registration relative to TSP in TopSpin 3.0 
(Bruker BioSpin, Germany). 

Statistical methods 

Differences in clinical characteristics of cases and controls 
were tested using Fisher s exact test for categorical variables 
and analysis of variance, ^-tests or Kruskal-Wallis for con- 
tinuous variables, as appropriate. The Kruskal-Wallis test 
was used to compare the relative abundance of distinct 
taxonomic units. Data were analyzed using R [39], linear 
discriminant analysis effect size (LEfSe) [40], QIIME 
[41,42], and SAS [43]. LEfSe was used to identify the phylo- 
genetic features that differed significantly between all NEC 
cases and controls and later between NEC sub-types and 
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controls; this program uses Kruskal-Wallis tests to identify 
differences in abundance between groups at the alpha level 
of 0.05 [40]. 

Microbial community analysis 

We first examined alpha diversity [44,45], which was ana- 
lyzed using the Simpson diversity index and Chaol richness 
index. Samples collected prior to NEC onset were 
compared to controls without elimination of rare 
reads but after raref)dng (standardizing) to 2,000 sequence 
reads per sample. 

To examine the beta diversity of microbial communities, 
we used non-metric dimensional scaling (NMDS) [46,47], 
undertaken in R, to ordinate the microbial communities of 
samples based on the weighted UniFrac distance calculated 
by QIIME. To improve signal to noise and reduce random 
error, we excluded rare OTUs from the beta diversity ana- 
lyses. Rare OTUs were defined as those detected in only 
one sample or with less than five sequences in the overall 
dataset [10]. This resulted in 99 distinct OTUs that were 
included for analysis. 

Ordination using NMDS was undertaken with several 
random starts to avoid entrapment in local optima. The 
results were centered and scaled, so that the variance 
was maximized along the first NMDS axis. This method of 
ordination was chosen because NMDS measures the close- 
ness of fit (stress) based on ranking of the dissimilarity of 
values, with no assumption of multivariate normality, and 
is a powerful and flexible method that handles sparse, non- 
parametric data well. Use of the weighted UniFrac distance 
metric was selected, as it accounts for the relative abun- 
dance and relatedness of taxa, rather than merely their 
presence or absence. Selection criteria for the number of 
dimensions in NMDS analysis were based on the protocol 
used in PC-ORD 6 software [47]. For each ordination, we 
then ran a cluster analysis on the UniFrac distance matrix 
using the Ward minimum variance method [48] to object- 
ively identif)^ clusters of samples with similar microbial 
composition. In this method, the distance between two 
clusters is the analysis of variance sum of squares between 
the clusters summed over all the variables. At each step, 
the within-cluster sum of squares is minimized over all 
partitions obtainable by merging clusters from the previous 
generation in order to maximize the likelihood at each level 
of the hierarchy. The number of clusters is determined by 
examining the scree plot resulting from the pseudo F and T 
plots. We used the SAS PROC Cluster procedure to 
conduct the clustering analysis. 

Metabolomics allows identification of distinct patterns 
of small molecules generated during both host and micro- 
bial cellular metabolism [25-29]. Urinary metabolomics 
was thus undertaken in search of surrogate biomarkers of 
dysbiosis and additional clues regarding the microbially 
distinct NEC cases in relation to controls and to each 



other. Urinary metabolite data were analyzed using princi- 
pal component analysis (PCA) as implemented in AMIX 
(Bruker Biospin, Billerica, MA). NMR spectra were pre- 
pared for PCA using manual binning to avoid splitting of 
peaks. Individual metabolites were compared between NEC 
cases and controls using a ^-test as implemented by Proc 
MULTTEST in the SAS 9.2 software (SAS Institute, Cary, 
NC). The Benjamini and Hochberg procedure [49] was 
used to control for multiple comparisons, using an adjusted 
P value of <0.05 [49]. After identification of significant 
metabolites, generalized estimating equations were applied 
to adjust for multiple samples per subject and to control 
for potential confounding factors. Only metabolites that 
were significant at <0.05 in both analyses were considered 
significant. A total of eight samples analyzed by NMR were 
excluded based on poor sample quality or as outliers based 
on the Hotelling T2 method [50]. 

Results 

Preterm infant cohort 

The characteristics of NEC cases versus controls are shown 
in Table 1. Controls were generally well matched to NEC 
cases on clinical factors, and did not differ in regard to birth 
weight (median 850 g), gestational age (median 26 weeks), 
race (37% black), gender (51% female), mode of delivery 
(66% Cesarean section), maternal antibiotic use at the time 
of delivery (51%), or infant antibiotic use for >5 days in the 
first week (34%). All study infants were fed the mothers 
own milk or human donor milk; the timing and degree of 
feeding was not significant in relation to NEC. Only 



primiparity differed between NEC cases 
Table 1 Characteristics of study infants 


(64%) and controls 


Variable, description 


NEC 
n = 11 


Controls 
n = 21 


Birth weight, mean (SD) (g) 


791 (212) 


839 (187) 


Gestational age, mean (SD) (weel<s) 


25.5 (1.8) 


25.9 (1.5) 


Non-Hispanic blacl<, number (%) 


5 (45.5) 


8 (38.1) 


Male, number (%) 


6 (54.6) 


8 (38.1) 


Patent ductus arteriosus (PDA), number (%) 


7 (63.36) 


12 (57.1) 


Primiparous, number (%) 


7 (63.6)' 


5 (23.8) 


Prenatal steroids given (any), number (%) 


10 (90.9) 


19 (90.5) 


Hypertension/eclampsia, number (%) 


2 (18.2) 


7 (33.3) 


Antepartum hemorrhage, number (%) 


1 (9.1) 


5 (23.8) 


Multiple births, number (%) 


1 (9.1) 


7 (33.3) 


Antepartum antibiotic therapy, number (%) 


6 (54.6) 


10 (47.6) 


Cesarean section delivery, number (%) 


7 (63.6) 


14 (66.7) 


Chorioamnionitis, number (%) 


1 (9.1) 


1 (4.8) 


Antibiotic use in the first week for 


4 (36.4) 


6 (28.6) 



>5 d, number (%) 

^NEC vs controls, primiparity P - 0.05. 
NEC: necrotizing enterocolitis. 
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(24%, P = 0.05), but was not significantly associated with 
microbial composition. Of the 11 NEC cases, 8 were Bells 
stage II and 3 were Bells stage III (surgically treated). Four 
of the NEC cases died, and three of the NEC cases devel- 
oped sepsis (two Klebsiella isolates, one coagulase-negative 
Staphylococcus isolate). The three non-NEC deaths inclu- 
ded for comparison were white, male, and younger (24 and 
25 weeks gestational age) and smaller (<850 g birth weight) 
than the other infants, but otherwise unremarkable. For ac- 
cess to the metadata, please see Additional file 1. 

Dominant organisms 

Consistent with previous studies in preterm infants 
[12,51-53], the dominant phyla were Proteobacteria and 
Firmicutes, with a minor contribution (1% to 2%) from 
Bacteroidetes and Actinobacteria. The most common 
genera were, in order of relative abundance: Enterobacten 
Staphylococcus y Escherichia^ Enterococcus, Leuconostoc, 
Lactococcus, Streptococcus ^ and Clostridia, The first four 
genera accounted for more than 90% of the microbial 
sequence reads. Pseudomonas, another Proteobacteria often 
associated with NEC or sepsis in preterm infants, occurred 
in 31% of samples, while together Lactobacillaceae and 
Bifidobacteriaceae, beneficial Gram-positive organisms (of 
the phyla Firmicutes and Actinobacteria, respectively), were 
present in only 19% of samples. 

At the level of phyla, controls had a median relative 
abundance of approximately 80% Proteobacteria (Gram- 
negative organisms) and 20% Firmicutes (Gram-positive or- 
ganisms), with a small proportion of Bacteroidetes and 
Actinobacteria; this pattern was remarkably stable over the 
first few weeks of life. Furthermore, most of the sequences 
contributing to the relative abundance of these large phyla 
came from only a few host-associated genera, Enterococcus 
and Staphylococcus for Firmicutes and Enterobacter and 
Escherichia for Proteobacteria. In infants who later 
developed NEC, microbial community composition differed 
sharply with the median relative abundance of Proteo- 
bacteria less than 40% and Firmicutes approximately 60% 
during postnatal days 4 to 9. A dramatic difference was 
then observed in samples from NEC cases such that over 
90% of the microbial communities were composed of 
Proteobacteria, while less than 10% were composed of 
Firmicutes (Figure 1). In the three non-NEC deaths (data 
not shown), Firmicutes strongly dominated, accounting for 
70% to 90% of microbial communities, with most of the 
remaining community composition being Proteobacteria. 

Systematic comparison of all NEC samples and all 
control samples collected between postnatal days 4 to 16 
found no significant differences in microbial compos- 
ition. NEC and control samples were then compared by 
week, and the only significant difference occurred in the 
relative abundance of Propionibacterium, a genus of the 
phylum Actinobacteria. Propionibacterium includes both 



skin- and intestinal-tract-colonizing organisms; members 
have demonstrated probiotic as well as pathogenic potential 
[54,55]. During days 4 to 9, Propionibacterium was identi- 
fied in samples from 10 (56%) of the 18 control infants, but 
none of the 9 infants who later developed NEC {P = 0.01, 
Figure 2A). No difference was found in the relative abun- 
dance of Propionibacterium between NEC and control 
samples collected from days 10 to 16 of life. 

Alpha diversity 

During postnatal days 4 to 9, infants who later devel- 
oped NEC tended to have samples with lower alpha 
diversity than controls as measured by the Chaol 
index (median 9.2 for NEC, 18.4 for control samples; 
Kruskal-Wallis, P = 0.086; Figure 2B) with a similar trend 
using the Simpson index {P = 0.221, data not shown). 
After day 9 of life, NEC samples continued to trend 
towards lower median alpha diversity than controls, but 
no significant differences were found by either index. 

Dysbiosis-identified sub-types of NEC 

For all study infants, the relative similarity of microbial 
communities between samples (beta diversity) was then 
examined by calculating their weighted UniFrac distances, 
and visualized using non-metric multi-dimensional scaling 
ordination (NMDS). Based on ordination of day 4 to 9 
samples (Figure 3A), application of the Ward minimum 
variance identified four clusters, designated I though IV, 
indicating microbial community similarity. Of these, only 
Clusters I and II included NEC cases. Cluster I consisted of 
samples from four NEC cases (NEC-I), two non-NEC 
deaths, and two controls (Figure 3B), and was characterized 
by dominance of organisms of the phylum Firmicutes, class 
Bacilli. Cluster II samples consisted of the remaining 5 
NEC cases (NEC-II) as well as 12 of the 18 controls, and 
were characterized by dominance of organisms of the 
phylum Proteobacteria, family Enterobacteriaceae. 

Within Cluster I, the genera Enterococcus and Staphylo- 
coccus, taxa representing different orders of Bacilli, 
accounted for 98% or more of the microbial community 
of samples from NEC-I infants. In the two non-NEC 
deaths found in Cluster I, the same taxa constituted 80% 
and 95% of their microbial communities, and in the two 
control samples, 62% and 73%. Comparing the NEC cases 
between clusters, the relative abundance of Firmicutes 
(specifically. Bacilli) was significantly higher in samples 
from NEC-I than NEC-II cases (median 99.3%, NEC-I ver- 
sus 17%, NEC-II, P = 0.014). Further, NEC-I samples had 
a significantly greater relative abundance of Firmicutes, 
specifically. Bacilli {P = 0.001), compared to all controls. 
Conversely, the relative abundance of Proteobacteria, fam- 
ily Enterobacteriaceae, was significantly higher in samples 
from NEC-II than NEC-I cases (median 83%, NEC-II 
versus 0.4%, NEC-I, P = 0.014), but NEC-II samples did 
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Figure 1 The relative abundance of bacterial phyla in infants who developed NEC vs control infants. Columns represent samples from 
days of life for 4 to 9 and 10 to 16. Data are graphed as box plots. The middle bar represents the median, the outer horizontal lines of the box 
represent the 25th and 75th percentiles. Dots above or below each box indicate outliers. NEC: necrotizing enterocolitis. 



not differ significantly in microbial composition compared 
to all controls during days 4 to 9. 

The ordination of samples from days 10 to 16 (Figure 4A) 
and application of the Ward minimum variance method 
identified three microbial community clusters (A, B and D) 
and an outlier (C). The NEC-I infants that had tightly 
clustered in samples from the first 4 to 9 days were 
dispersed across Clusters A, B and D in this ordination, 
with no discernible similarity. In contrast, consistent with 



the ordination for days 4 to 9, all of the NEC-II cases were 
found in a single cluster (Cluster A) during days 10 to 16. 
Cluster A also included one of the dispersed NEC-I cases, 
the composition of which was 83% Enterobacter, Cluster A 
thus included 7 (78%) of the 9 NEC cases and 12 (63%) of 
19 controls, and was characterized by preponderance of 
Proteobacteria, specifically, the family Enterobacteriaceae 
(Figure 4B). Compared to all control samples during days 
10 to 16, the six NEC-II/Cluster A cases had a significantly 
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Figure 2 Microbial community differences between NEC and control infants, days 4 to 9 samples. (A) This box plot indicates the relative 
abundance of the genus Propionibocterium in 18 control samples and 9 samples prior to NEC onset during days 4 to 9 of life. None of the infants 
who later developed NEC, but 10 (56%) of the control samples had detectable amounts of Propionibacterium (using Fisher's exact test for the 
presence or absence in NEC vs control samples, P = 0.009). (B) This box plot indicates the Chaol richness index in samples from days 4 to 9 of 
life. NEC cases tended towards lower diversity than controls, but the comparison was not significant (Kruskal-Wallis, P = 0.086). For each box plot, 
the middle bar represents the median, the outer horizontal lines of the box represent the 25th and 75th percentiles. The dots overlaying the 
plots indicate the values of individual samples. Dots are distributed horizontally in order to prevent overlapping. NEC: necrotizing enterocolitis. 
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elevated relative abundance of Proteobacteria (specifically, 
Enterobacteriaceae, P = 0.010). All six NEC-II/Cluster A in- 
fants versus 8 (42%) of 19 control samples were composed 
of 90% or more Proteobacteria {P = 0.020). We systematic- 
ally compared the composition of samples, and found, as in 



the prior week, that during days 10 to 16, NEC-I samples 
had a significantly higher relative abundance of Firmicutes 
than NEC-II samples, specifically. Bacilli (median 33%, 
NEC-I versus 0.3%, NEC-II, P = 0.020). However, the rela- 
tive abundances of these taxa in NEC-I cases during days 



Morrow et at. Microbiome 2013, 1:13 
httpy/www.microbiomejournal.com/content/l /I /1 3 



Page 8 of 16 



Days 10-16 



C\J 

w 
x 

< 

(f) 

Q 



o 
I 



CO _ 

d 




B 



NEC-I 



-0.2 



10 n„ 

0.8 
0.6 
0.4 
0.2 
0.0 




r, 



0.0 0.2 
NMDS Axis 1 

Cluster A 



• NEC-I 
® NEC-II 

• Controls 



0.4 



Clusters B & C Cluster D 



I 



NEC-II 



Controls 



Deaths 
(non-NEC) 



1.0 
0.8 
0.6 - 
0.4 - 
0.2 



i5 0.0 

0) 

OC 

1.0 -, 
0.8 
0.6 -I 
0.4 
0.2 -I 
0.0 



li 



HllllUh jUUD iu 





(Mcomooc\itn'»-'*in<o 

y- ■<- y- i-CJCMCOCOCOCO 


CO r>. o) 

1- CM CM 


m 


Leuconostoc □ 


Welssella 


□ 


Enterococcus ■ 


Acinetobacter 


□ 


Escherichia □ 


Clostridiaceae 1 


■ 


Staphylococcus ■ 


Streptococcus 


□ 


Enterobacter ■ 


Lactococcus 




00 O 1- 

CM ■•- 1- CO 



Figure 4 NMDS ordination of microbial communities for days 10 to 16 of life. (A) This ordination was based on weiglited UniFrac beta 
diversity and run witli tliree dimensions in tine vegan pacl<age of R, resulting in a stress of 2.43. Controls are shown in black and NEC and 
non-NEC deaths in either red (cases identified as NEC-I in the ordination for days 4 to 9) or green (cases identified as NEC-II in the ordination for 
days 4 to 9). Clusters identified using the Ward minimum variance method are indicated in this ordination as A, B and D; C is identified as an 
outlier value. (B) Bars indicate the relative abundance of the ten most common genera in samples from individual infants, whose study numbers 
are noted on the horizontal axis. NEC sub-types (NEC-I and NEC-II) correspond to the NEC cases included in Cluster I and Cluster II, respectively, in 
the ordination of samples for days 4 to 9 (Figure 3). The clusters identified in this ordination are indicated by column headers. Clusters indicate 
microbial community similarity. NEC: necrotizing enterocolitis; NMDS: non-metric dimensional scaling. 



Morrow et at. Microbiome 2013, 1:13 
http://www.nnicrobionnejournal.conn/content/1/1/13 



Page 9 of 16 



10 to 16 (median 33% Firmicutes or Bacilli) were not as 
extreme as that observed earlier (median >98% Firmicutes 
or Bacilli during days 4 to 9). In contrast, NEC-II cases 
became even more dominated by Proteobacteria, specific- 
ally, the Enterobacteriaceae (median 99.6%, NEC-II versus 
38%, NEC-I, P = 0.020). Furthermore, during days 10 to 
16, NEC-II samples also had a significantly greater relative 
abundance of Proteobacteria compared to all controls 
(median 99.6%, NEC-II versus 84%, controls, P = 0.01). 

In summary, four NEC cases were classified as NEC-I, 
all of whom were in Cluster I, uniquely characterized by > 
98% relative abundance of Firmicutes, class Bacilli, during 
postnatal days 4 to 9. Six NEC cases were classified as 
NEC-II, all of whom were in Cluster A during days 10 to 
16 as well as Cluster II during days 4 to 9. These NEC-II 
cases were all composed of >90% Proteobacteria, family 
Enterobacteriaceae, in samples from postnatal days 10 to 
16. One NEC infant (subject 16) lacked a sample from days 
10 to 16 and could not be formally classified, but followed 
the pattern of NEC-II based on their sample for days 4 to 9, 
which was dominated by Escherichia and is within Cluster 
II. All three non-NEC deaths were characterized by early 
Firmicutes dysbiosis, as for NEC-I. One of these non-NEC 
deaths (subject 40) lacked a sample from days 4 to 9, but 
was considered to be a high Firmicutes dysbiosis based on 
their sample for days 10 to 16, which was predominantly 
composed of Staphylococcus, The primary ordinations of 
beta diversity (Figures 3 and 4) included all samples and all 
non-rare sequence reads, but ordination using rarefied 
samples (Additional file 2: Figure SI) was also conducted 
and had the same pattern as that shown in Figures 3 and 4. 

The alpha diversity of the microbial communities was 
then reanalyzed in relation to these NEC sub-types, as 
before using the full set of OTUs without eliminating rare 
sequence reads but rarefying samples to 2,000 sequence 
reads per sample. No significant differences were found in 
NEC sub-types by either the Simpson or Chaol indices. 
We then compared the clinical characteristics of the NEC-I 
and NEC-II infants. NEC-I onset occurred between postna- 
tal days 7 and 21, while NEC-II onset occurred between 
postnatal days 19 and 39 [P = 0.086, Kruskal-Wallis test). 
It is noteworthy that the non-NEC deaths, which clustered 
with NEC-I samples in the ordination of samples from days 
4 to 9, had a similarly high Firmicutes dysbiosis in the first 
week of life, and that these deaths occurred between days 9 
to 17, the same postnatal period during which NEC-I cases 
occurred. We then compared the two NEC sub-types in 
relation to each variable listed in Table 1, as shown in 
Additional file 2: Table SI. The only statistical difference in 
the two NEC sub-types was in the administration of antibi- 
otics to the mother at the time of delivery. No NEC-I 
infants, but 6 (86%) of the 7 NEC-II infants, were born of 
mothers who were given antibiotics at the time of delivery 
(P = 0.015). While different delivery modes did not readily 



explain the association between NEC-II and maternal anti- 
biotic use, we examined the beta diversity of the microbial 
communities in relation to delivery mode and NEC or con- 
trol status, and observed a tendency for NEC-I to cluster 
with C-section delivery, and NEC-II to cluster with vaginal 
delivery (Additional file 2: Figure S2). For two other clinical 
factors, patent ductus arteriosus (PDA, a heart problem of 
preterm infants, which has been linked to later-onset NEC) 
and primiparity, we observed a trend {P = 0.089) towards a 
different distribution in NEC-I and NEC-II cases. 
Each factor was independently found in only 1 (25%) 
of 4 NEC-I infants versus 6 (86%) of 7 NEC-II infants. No 
other clinical characteristics appeared to differ between 
the NEC sub-types, nor in comparison to controls. 

DNA extraction 

As DNA extraction methods can affect the results of stud- 
ies of the 16S rRNA gene, we undertook a series of analyses 
to identify potential effects in our data. Examination of the 
extraction protocol in relation to beta diversity found 
no influence on identification of community clusters 
(Additional file 2: Figure S3A). Examination of the 
extraction protocol in relation to the relative abundance of 
bacteria at all taxonomic levels found taxa that differed by 
extraction method, but none of the taxa identified also 
differed significantly between NEC or NEC sub-types and 
controls (Additional file 2: Figure S3B). Finally, examin- 
ation of the extraction protocol in relation to alpha diver- 
sity found no association with the Simpson index, but did 
find association with the Chaol index {P = 0.01). Finally, 
we examined the extraction protocol in regression models 
of alpha diversity in relation to NEC, and found that it did 
not influence the findings. These data strengthen our 
conclusion that early differences occur in the microbiome 
of premature infants prior to onset of NEC, independent 
of extraction techniques. 

Urinary metabolomic analysis 

Principal component analysis did not demonstrate any quali- 
tative clustering in the set of urinary metabolites in relation 
to all NEC cases, NEC-I or NEC-II versus controls, nor the 
NEC sub-types in relation to each other. Individual urinary 
metabolite values were compared using ^-tests corrected for 
multiple comparisons. No urinary metabolites differed sig- 
nificantly among all NEC cases and controls. However, 
three metabolites, alanine, pyridoxine (4-pyridoxate) and 
histidine, significantly distinguished NEC-I and NEC-II 
from each other as well as one of the NEC sub-types 
from controls (Figure 5 and Additional file 2: Table S2). 
Alanine was significantly {P < 0.001) higher in NEC-I ver- 
sus NEC-II and NEC-I versus the control samples though 
the metabolite did not differ between all NEC versus con- 
trol samples (Figure 5A). Pyridoxine followed a pattern 
similar to alanine, though not as significant. The 
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Figure 5 Box plots of urinary alanine and histidine in relation 
to NEC sub-types versus controls. Urine samples collected during 
days 4 to 9, restricted to one sample per infant (31 samples). 
Analysis of urinary alanine, pyridoxine and histidine among NEC-I, 
NEC-II, death and control subjects using multiple samples per 
individual (60 samples) is shown in Supplementary Information, 
Table S2 of Additional file 2. (A) Urinary alanine. Using the Kruskal- 
Wallis test, urinary alanine was significantly higher in NEC-I vs 
controls {P = 0.044) and NEC-I than NEC-II {P = 0.023), but did not 
differ between NEC-II vs controls or all NEC vs controls. (B) Urinary 
histidine. Using the Kruskal-Wallis test, urinary histidine was 
significantly lower in NEC-II vs controls {P = 0.023). Histidine also 
tended to be lower in NEC-II vs NEC-I {P = 0.059), but did not differ 
between NEC-I vs controls or all NEC vs controls. (C) Ratio of urinary 
alanine to histidine. Using the Kruskal-Wallis test, the ratio of urinary 
alanine to histidine was significantly higher in NEC cases overall vs 
controls (P = 0.023), but did not differ between the NEC sub-types. 
The optimal cut-point in the alanine:histidine ratio to differentiate 
NEC cases from controls was identified as a ratio greater than 4 
(predictive value 78%, P = 0.007). NEC: necrotizing enterocolitis. 



remaining metabolite, histidine (Figure 5B), differed in 
pattern and was significantly lower in NEC-II samples 
than controls {P = 0.01) and NEC-I samples {P < 0.001). 

Alanine, pyridoxine and histidine are commonly syn- 
thesized by bacterial enzymes, as documented by KEGG 
[56] and may be considered plausible biomarkers of bacter- 
ial dysbiosis. We analyzed the relationship of these urinary 
metabolites to microbial community characteristics in our 
dataset, using only the first collected urine sample within 
the window for days 4 to 9 when multiple samples were 
available. Alanine (Table 2) was significantly associated with 
characteristics of the intestinal microbial community. 
Alanine levels were most strongly associated {P < 0.001) 
with Cluster I samples identified in the ordination for days 
4 to 9, including the NEC-I cases, non-NEC deaths and the 
controls in that cluster. Alanine was also directly correlated 
with the relative abundance of Firmicutes {P = 0.009), 
and inversely correlated with the relative abundance 
of both Proteobacteria {P = 0.027) and Propionibacterium 
{P = 0.015). Alanine was not associated with the number of 
days elapsed between collection of the urine sample and 
case onset, and thus appeared to be associated with 
dysbiosis rather than the host response. Pyridoxine was not 
as strongly associated with microbial community charac- 
teristics, and its association was explained by correlation 
with alanine (data not shown). Histidine was not independ- 
ently associated with microbial community characteristics 
(Table 2), but had a strong inverse association with the 
number of days between collection of the urine sample 
and case onset (Spearman's rho = -0.71, P = 0.004). Thus, 
histidine appeared to be associated with the host response 
rather than dysbiosis. Because alanine and histidine were 
observed to have distinct and possibly complementary 
associations in relation to NEC-I and NEC-II (Table 2 and 
Figure 5), we examined the ratio of alanine to histidine in 
relation to NEC outcomes (Figure 5C) and dysbiosis 
(Table 2). Remarkably, the ratio was positively associated 
with overall NEC (Kruskal-Wallis, P = 0.001), was inversely 
associated with the relative abundance of Propionibacterium 
(Spearmans rho = -0.57, P = 0.002), and did not differ by 
NEC sub-type nor in relation to time between sample 
collection and disease onset. 

Predictive biomarkers 

To systematically evaluate the signatures identified above as 
potential predictors of NEC onset, we analyzed the area 
under the receiver operating curve (ROC), beginning with 
defined microbial colonization characteristics (Table 3). 
First, we examined high Firmicutes dysbiosis (>98% relative 
abundance in samples from days 4 to 9), which occurred in 
4 of 9 NEC cases versus 0 of 18 controls {P = 0.007) who 
had a sample during days 4 to 9. This measure had a good 
predictive value (72%) and optimal specificity (100%), but 
very poor sensitivity (44%). Next, we examined the absence 
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Table 2 Association between urinary metabolite values, microbial community characteristics and NEC^ 

Characteristic Alanine Histidine Alanine: 

histidine ratio 

Spearman's rho {P value) 

Firmicutes, relative abundance +0.49 +0.32 +0.20 

(P = 0.009) iP = 0.095) {P = 0.3 1 5) 

Proteobacteria, relative abundance -0.42 -0.24 -0.16 

(P = 0.027) (P = 0.221) (P = 0.413) 

Propionibocterium, relative abundance -0.45 +0.02 -0.57 

(P = 0.01 5) [P = 0.920) (P = 0.002) 

Median values by cluster or NEC status (Kruskal-Wallis P value) 

Cluster I samples versus 3.34 0.59 4.97 

All other samples 1.52 0.44 3.46 

(P< 0.001) (P = 0.140) (P = 0.029) 

Cluster II samples versus 1.52 0.44 3.51 

All other samples 2.46 0.59 4.86 

(P = 0.007) (P = 0.1 04) [P = 0.403) 

NEC samples versus 2.13 0.33 6.32 

All others excluding non-NEC deaths 1.72 0.53 3.42 

{P = 0.467) {P = 0.1 69) (P = 0.022) 

NEC-I versus 2.81 0.55 4.86 

All others excluding non-NEC deaths 1.57 0.44 3.51 

(P = 0.023) iP = 0.394) = 0.1 77) 

NEC-II versus 1.50 0.25 7.35 

All others excluding non-NEC deaths 1.84 0.53 3.53 

(P = 0.229) (P = 0.015) (P = 0.126) 

^The urine and stool samples were collected from 28 infants between postnatal days 4 to 9. Analyses include 9 NEC cases, 2 deaths and 17 controls, except 
analysis of NEC, NEC-I and NEC-II, for which the non-NEC deaths were removed as a competing cause. Metabolites measured as normalized peak intensity. 
Significant values are bolded. 

of Propionibacterium in samples from days 4 to 9 as a relative abundance, both of which maximized the predictive 

potential predictor, which occurred with all NEC cases but value (62%), but neither was significant. We selected the > 

only 44% of 18 controls {P = 0.009). While the predictive 90% cut-point, as it included all NEC-II cases identified in 

value of the absence of Propionibacterium was good (78%) Cluster Il/Cluster A by ordination. We then examined the 

for prediction of NEC, with optimal sensitivity (100%), the likelihood of developing NEC from having either form of 

specificity was poor (56%). We then examined high dysbiosis. This analysis, limited to infants with samples in 

Proteobacteria dysbiosis in days 10 to 16 samples as a both time periods, found that all 7 NEC cases versus 8 

predictor of subsequent NEC. ROC analysis identified two (50%) of 16 controls {P = 0.052) had a form of early 

cut-points for high Proteobacteria dysbiosis, > 90% or >98% dysbiosis, which increased the predictive value to 75%. But, 

Table 3 Microbial and metabolite predictors of NEC 



Criteria 


Description 


NEC 


Controls 




Predictive 
value (c) 


Sensitivity 


Specificity 


1 


Higli Firmicutes dysbiosis (>98%, days 4 to 9) 


4/9 (44%) 


0/18 (0) 


0.007 


72% 


44% 


100% 


2 


No Propionibocterium (days 4 to 9) 


9/9 (100%) 


8/18 (44%) 


0.009 


78% 


100% 


56% 


3 


Higli Proteobacteria dysbiosis (>90%, days 10 to 16) 


6/9 (67%) 


8/19 (42%) 


0.420 


62% 


67% 


58% 


4 


Dysbiosis (criterion 1 or 3) 


7/7 (100%) 


8/16 (50%) 


0.052 


75% 


100% 


50% 


5 


Combined criteria 2 and 4 (no Propionibocterium + dysbiosis) 


7/7 (100%) 


4/16 (25%) 


0.001 


88% 


100% 


75% 


6 


Urinary alanine:histidine ratio >4 (days 4 to 9) 


9/11 (82%) 


5/20 (25%) 


0.007 


78% 


82% 


75% 



values based on Fisher's exact test. The variations in denominator are due to varying sample availability. 
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the highest predictive value (88%) was obtained from 
a combination of either form of dysbiosis and lack of 
Propionibacterium, which occurred in 7 of 7 NEC cases 
and 4 (25%) of 16 controls [P = 0.001), and thus, had ideal 
sensitivity (100%) and good specificity (75%). 

We then examined urinary metabolites as surrogate 
markers for prediction of NEC. On their own, alanine, pyri- 
doxine and histidine were not significantly associated with 
a risk of overall NEC. However, when analyzed together, 
alanine and histidine were significantly associated with 
NEC (Figure 5C and Table 2). ROC analysis found the 
optimal cut-point to be a urinary alanineihistidine ratio >4 
(Table 3), which yielded a predictive value of 78%. Alanine: 
histidine ratio values above the cut-point occurred in 9 
(82%) of 11 NEC cases and 5 (25%) of 20 controls 
[P = 0.007), providing very good sensitivity (82%) 
and good specificity (75%). 

In summary, the predictive values of variables defining 
aspects of the early intestinal microbiome or urinary 
metabolome ranged from 62% to 88%. Each potentially 
predictive biomarker that we examined was significant 
except for Proteobacteria dysbiosis, which had the lowest 
predictive value (62%). Proteobacteria dysbiosis was sig- 
nificant as a predictive variable only if examined together 
with Firmicutes dysbiosis (75% predictive value). The 
highest predictive value, 88%, obtained from the combin- 
ation of either form of dysbiosis and the absence of Propi- 
onibacterium in the first week of life, suggests the upper 
limit of prediction that may be obtained from analysis of 
the early microbial community composition. Given the 
small sample size, validation is necessary in larger studies 
as well as other populations. Nevertheless, these data pro- 
vide strong initial proof of concept that early microbial 
colonization has a critical role in the development of 
NEC. This perspective is reinforced by finding a urinary 
marker (alanine:histidine ratio >4) from analysis of 
samples from the first week of life, which was associ- 
ated with microbial community composition and had 
a predictive value of 78% for prediction of NEC. 

Discussion 

Intestinal colonization has long been thought to contribute 
to NEC in preterm infants, despite a failure to consistently 
identify a single pathogen or pathogenic microbial commu- 
nity associated with its occurrence. The results of this study 
indicate the need to focus metagenomic research for 
preterm infants on the first few weeks of life. Our study 
revealed several factors that were significantly associated 
with subsequent risk of NEC, specifically, the absence of 
Propionibacterium in the first week and two distinct forms 
of dysbiosis that occurred over the first two weeks of life: 
During days 4 to 9, the microbial community prior to onset 
of early NEC cases consisted predominantly (>98%) of 
Firmicutes, specifically class Bacilli, of which the dominant 



genera were Staphylococcus and Enter ococcus. This unique 
Gram-positive microbial signature was not shared by any of 
the control infants. The second clustered microbial 
phenotype occurred during 10 to 16 days of life pre- 
ceding later onset NEC cases, and consisted of a 
Gram-negative Proteobacteria signature, specifically, 
family Enterobacteriaceae, of which the dominant genera 
were Enterobacter and Escherichia, The disparate timing 
and composition of these high Firmicutes and high 
Proteobacteria microbial signatures are intriguing, espe- 
cially given their association with early versus late onset 
NEC or death. We also found maternal antepartum anti- 
biotic use to be significantly higher in NEC-II compared to 
NEC-I cases. While there appeared to be a trend towards 
association of NEC-I with more C-section delivery and 
NEC-II with more vaginal delivery, this was not significant. 
Our data also suggested a trend towards lower alpha diver- 
sity of samples from later NEC cases compared to controls, 
but this was also not statistically significant. However, the 
limited sample size of this study does not preclude the 
possible differences that might be identified from analysis 
of larger studies of the early microbiome in future. 

While we are not aware of any previous study that has 
demonstrated two distinct forms of intestinal dysbiosis 
prior to onset of NEC, previous studies support the plausi- 
bility of our findings. Consistent with our finding that dis- 
tinct forms of dysbiosis were composed of Firmicutes and 
Proteobacteria, Koenig et al. [51] reported that these phyla 
are strongly negatively correlated with each other in normal 
infant colonization. A previous study of 18 preterm infants 
by Mai et al. [10] reported that the relative abundance of 
Firmicutes was higher in samples taken one week prior to 
NEC onset compared to controls, followed by higher abun- 
dance of Proteobacteria within 72 hours prior to NEC 
onset. Although the authors did not describe distinct forms 
of dysbiosis for distinct subgroups as reported here, their 
findings nevertheless suggested a key role for both major 
phyla at distinct timings. Finally, Taur et al. reported two 
forms of dysbiosis following antibiotic administration in 
allogeneic hematopoietic transplant patients; consistent 
with our study, one form of dysbiosis was characterized by 
dominance of Firmicutes {Enterococcus and Streptococcus) 
and the other form was characterized by dominance of 
various Proteobacteria [57]. 

Our discovery of two forms of dysbiosis by metagenomic 
analysis was supported by metabolomics, which identified 
differences among urine samples that were collected in the 
first week of life prior to case onset. These two '-omic' 
methods provide complementary information. Urinary 
metabolomics is a sensitive method of identifying groups 
that differ in their intestinal bacterial colonization [25,44]. 
Production and utilization of specific metabolites differ 
among colonizing bacteria, which in turn affects their bio- 
availability to the host [26]. While metagenomic analysis of 
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microbial DNA provides a comprehensive snapshot of bac- 
terial composition, metabolomic comparison of microbial 
colonization phenotypes provides a snapshot of their differ- 
ential metabolic activity [27-29]. In our study, three urinary 
metabolites (alanine, pyridoxine and histidine) differed sig- 
nificantly between one of the NEC sub-types and controls. 
Urinary alanine was higher in NEC-I compared to controls, 
positively correlated with the relative abundance of 
Firmicutes, and negatively correlated with the relative 
abundance of Proteobacteria and Propionibacterium, 
Alanine is a non-essential amino acid, which is ubiquitously 
incorporated into bacterial cell wall biosynthesis, a potential 
target of immune sensing [58-60]. As peptidoglycan consti- 
tutes most of the dry weight of Gram-positive organisms 
but only a small share of the dry weight of Gram-negative 
organisms, alanine seems a particularly promising candi- 
date to differentiate patients whose microbiome is strongly 
dominated by Gram-positive organisms. Indeed, alanine 
was previously reported to be significantly elevated in the 
feces of irritable bowel syndrome cases compared to 
controls, and positively associated with higher intestinal 
colonization with Gram-positive organisms [27]. Pyridox- 
ine, also elevated in the urine of NEC-I infants versus 
controls, is produced by bacteria in general [61] and may 
reflect bacterial abundance or growth. But pyridoxine, a 
correlate of alanine in our data, did not appear to be inde- 
pendently associated with microbial community compos- 
ition or NEC in the presence of alanine. A proteinogenic 
amino acid, histidine, differed between NEC-II and con- 
trols, but was not associated in our dataset with microbial 
community composition per se. However, an in vitro study 
of the metabolites of microbial growth previously reported 
that histidine was lower in medium with increased growth 
of Escherichia^ a genus that contributed to the Proteo- 
bacteria dysbiosis associated with NEC-II in our study [25]. 
While none of the metabolites were alone predictive of 
overall NEC, the ratio of alanine to histidine was signifi- 
cantly associated with NEC overall as well as with the 
relative abundance of Propionibacterium, 

The preterm infants in our study generally lacked micro- 
biota that are known to influence healthy immune devel- 
opment and oral tolerance, including Bifidobacterium^ 
Bacteroides fragilis and other commensal gut microflora 
[23,24,62]. While the lack of these beneficial organisms is 
characteristic of preterm infants, the primers used in this 
study have been documented to be less than optimal for 
quantitative representation of Bifiidobacterium [63]. How- 
ever, our own unpublished data suggest that the presence 
or absence of Bifidobacterium was unlikely to have been 
highly biased. In this study, we detected Bifiidobacterium in 
15% of the samples. From the same ongoing cohort of 
preterm infants, we recently analyzed an additional 182 
samples that were amplified using 515 F/806R primers and 
sequenced on the MiSeq platform [41] and again found a 



15% Bifiidobacterium detection rate. Of the additional 
samples sequenced both by 515 F/806R (MiSeq) and 
V3-V5 (454), the same Bifiidobacterium-positive sam- 
ples were identified with each method. For one 
Bifiidobacterium-positive sample, we generated additional 
data from deep whole genome sequencing (WGS). The es- 
timated relative abundances of Bifiidobacterium by method 
were 22% by WGS, 14% by 515 F/806R (MiSeq) and 2% by 
V3-V5 (454). Taken together, these data suggest that our 
use of V3-V5 primers did not significantly misrepresent the 
presence or absence of Bifiidobacterium colonization 
in our dataset, but very likely underrepresented the 
relative abundance of Bifiidobacterium when present. 

Propionibacterium, a genus of the phylum Actinobacteria, 
was the only organism that differed significantly between 
all NEC cases and controls in this study. The organism was 
identified in the first postnatal week in about half of the 
controls but none of later NEC cases, suggesting a potential 
commensal role. This genus includes many species and 
strains that are used as probiotics by the dairy industry 
[54]. Other Propionibacterium commonly colonize the skin 
[64] and have been reported in breast milk [65]. These 
organisms are so named due to their production of 
propionic acid as weU as other short chain fatty acids 
that have a beneficial role in intestinal health. The 
role of Propionibacterium in the intestinal colonization of 
infants is not known. Nevertheless, our observation sug- 
gests a benefit from initial colonization with this organism, 
and the possibility that other commensals may also benefit 
preterm infants. 

The high Firmicutes dysbiosis that we observed may 
imply excessive exposure to the peptidoglycan that covers 
the surface of Gram-positive organisms. TLR2 recognizes 
peptidoglycan, and exhibits excessive signaling in the im- 
mature enterocyte [4]. The lack of exposure to LPS -bearing 
Gram-negative organisms in the first week of life may 
impair the development of tolerance in preterm infants, 
resulting in an even higher inflammatory response when 
presented. The Proteobacteria dysbiosis that we observed 
was characterized by high abundance of Enterobacteriaceae 
in the second week of life, consistent with in vivo and 
ex vivo studies indicating that NEC is a hyperinflammatory 
state resulting from excessive TLR4 signaling in response to 
LPS [9-11,13]. However, in our study, high Gram-negative 
predominance occurred in both NEC and control subjects. 
High relative abundance of Proteobacteria may induce 
heightened vulnerability, but the development of NEC may 
require additional insults or vulnerabilities, including later 
exposures to pathogens or oxidative stress [66]. Alterna- 
tively, control infants with similarly high colonization with 
Proteobacteria may be more immunologically tolerant. Our 
finding that no NEC cases but half of the controls had low 
but detectable levels of Propionibacterium^ and that the 
presence of this organism appeared to mitigate the risk of 
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NEC associated with high levels of Proteobacteria in the 
second week of life, is consistent with the concept that the 
early presence of commensal bacteria helps induce immune 
homeostasis [23]. 

While early colonization might contribute in various 
ways to the causal pathway leading to NEC, our findings 
are consistent with the suggestion that early dysbiosis 
has a time-sensitive role in dysregulation of the develop- 
ing immune response [19,21]. Supporting this putative 
role, a time-series analysis of the intestinal transcripts of 
gnotobiotic and conventionalized mice demonstrated 
that immune genes respond to microbial colonization in 
a temporal sequence that coordinates the development 
of the immune system to achieve homeostasis [20]. For 
example, T-cell maturation and tolerance-associated 
functions, such as IL-10 and Foxp3, are significantly 
increased in conventionalized mice by day eight after 
introduction of microbiota, a timing relevant to our 
findings. However, our study does not address causality. 
Experimental studies are needed of microbial host inter- 
actions and immune development of the immature 
mucosa to understand the potential role of time-specific 
microbial dysbiosis as related to NEC in preterm infants. 

The limitations of this study should also be consid- 
ered. The sample size of this study was modest, and our 
findings require validation in larger, multi-site cohorts. 
Further, we included three non-NEC deaths as a second- 
ary comparison group. The cause of death for the three 
non-NEC deaths was attributed to respiratory distress 
syndrome for two infants and suspected infection for one 
infant. While these deaths may be considered irrelevant to 
NEC, our data suggest similarity in the microbiome of 
NEC and the non-NEC (non-congenital) neonatal deaths 
that occurred in the same postnatal period [7,8,15]. The 
reason for the observed microbial similarity between high 
Firmicutes-associated NEC and non-NEC deaths in this 
study is not known. None of the non-NEC deaths were 
ever suspected of NEC. The infant with suspected infection 
had clinical signs of infection and received antibiotics and 
supportive treatment, but blood cultures were negative. 
Since the pathobiology of NEC remains an enigma, and no 
pathogens were identified that may have contributed to the 
non-NEC deaths, we cannot explain the similarity of the 
microbiome in non-NEC deaths and early NEC cases. 
Nevertheless, these initial findings suggest that immune 
dysregulation secondary to dysbiosis may be common to 
NEC and some non-NEC deaths. 

Conclusions 

Our findings provide important new insights, particularly 
regarding the role of the early microbiome in subsequent 
risk of NEC, including early Gram-positive and Gram- 
negative forms of dysbiosis. Further, the timing of onset of 
NEC tended to differ by the form of dysbiosis. While older 



preterm infants tend to have earlier onset of NEC [3], 
neither infant gestational age nor birth weight explain our 
findings. All study infants were <29 weeks gestation and < 
1,200 g birth weight, and neither variable differed between 
NEC and control infants nor between NEC cases with a mi- 
crobial phenotype characterized as Firmicutes (Bacilli) or 
Proteobacteria (Enterobacteriaceae) dysbiosis. We speculate 
that our findings may be relevant for understanding the 
variation in timing of NEC onset that has been reported by 
NICUs [3]. However, future studies are needed of 
multiple, independent sites to test that hypothesis. 
Regardless, the discovery of different forms of early 
microbial dysbiosis provides a focus for investigation 
of aberrant microbial-mucosal communication as part 
of the pathobiology leading to NEC. 

Our data also indicate that characterization of early 
dysbiosis and the presence or absence of potentially pro- 
biotic organisms, may serve as non-invasive biomarkers, 
which can together predict NEC in preterm infants. Using 
the combination of early microbial factors, we obtained a 
very high predictive value for NEC (88%), but given the 
small sample size of this study, and its conduct in a single 
population, this initial estimate should be taken with 
caution. Rather, the strength of this study is that it 
provides proof of concept that clinically significant 
prediction of NEC may be achieved by directly measuring 
fecal samples, or indirectly through surrogate biomarkers 
such as urinary metabolites during the first weeks of life. 
Further characterization of NEC, and potential sub-types of 
NEC, through -omic approaches and clinical and immuno- 
logic assessments of early postnatal life is needed to 
advance understanding of this complex disease. 
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